University of Minnesota Ph.D. dissertation. July 2017. Major: Biostatistics. Advisors: Julian Wolfson, Wei Pan. 1 computer file (PDF); x, 85 pages.When dealing with high-dimensional data, performing variable selection in a regression model reduces statistical noise and simplifies interpretation. There are many ways to perform variable selection when standard regression assumptions are met, but few that work well when one or more assumptions is violated. In this thesis, we propose three variable selection methods that outperform existing methods in such "messy data'' situations where standard regression assumptions are violated. First, we introduce Thresholded EEBoost (ThrEEBoost), an iterative algorithm which applies a gradient boosting ...
This publication is with permission of the rights owner (Sage) freely accessible.We present a new pr...
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/135979/1/sta4133_am.pdfhttp://deepblue....
With the prevalence of high dimensional data, variable selection is crucial in many real application...
<p>Most variable selection techniques for high-dimensional models are designed to be used in setting...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
Background Modern biotechnologies often result in high-dimensional data sets with many more varia...
Variable selection and model choice are of major concern in many statistical applications, especiall...
Variable selection plays an important role for the high dimensional data analysis. In this work, we ...
Variable selection and estimation for high-dimensional data have become a topic of foremost importan...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
Understanding high-dimensional data has become essential for practitioners across many disciplines. ...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
We present a new variable selection method based on model-based gradient boosting and randomly permu...
University of Minnesota Ph.D. dissertation. July 2018. Major: Biostatistics. Advisors: Julian Wolfso...
Multivariate analysis is a common statistical tool for assessing covariate effects when only one re...
This publication is with permission of the rights owner (Sage) freely accessible.We present a new pr...
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/135979/1/sta4133_am.pdfhttp://deepblue....
With the prevalence of high dimensional data, variable selection is crucial in many real application...
<p>Most variable selection techniques for high-dimensional models are designed to be used in setting...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
Background Modern biotechnologies often result in high-dimensional data sets with many more varia...
Variable selection and model choice are of major concern in many statistical applications, especiall...
Variable selection plays an important role for the high dimensional data analysis. In this work, we ...
Variable selection and estimation for high-dimensional data have become a topic of foremost importan...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
Understanding high-dimensional data has become essential for practitioners across many disciplines. ...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
We present a new variable selection method based on model-based gradient boosting and randomly permu...
University of Minnesota Ph.D. dissertation. July 2018. Major: Biostatistics. Advisors: Julian Wolfso...
Multivariate analysis is a common statistical tool for assessing covariate effects when only one re...
This publication is with permission of the rights owner (Sage) freely accessible.We present a new pr...
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/135979/1/sta4133_am.pdfhttp://deepblue....
With the prevalence of high dimensional data, variable selection is crucial in many real application...